An Anaphora Resolution-Based Anonymization Module

نویسندگان

  • Massimo Poesio
  • Mijail A. Kabadjov
  • P. Goux
  • Udo Kruschwitz
  • Elizabeth Bishop
  • Louise Corti
چکیده

Growing privacy and security concerns mean there is an increasing need for data to be anonymized before being publically released. We present a module for anonymizing references implemented as part of the SQUAD tools for specifying and testing non-proprietary means of storing and marking-up data using universal (XML) standards and technologies. The tool is implemented on top of the GUITAR anaphoric resolver.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Anaphora Resolution – What Helps in German

Although anaphora resolution has been a very active research area throughout the last decades, there exist only a few studies that focus on German anaphora. Strube and Hahn (1999) present a system for anaphora resolution for German based on an extension of Centering Theory. Müller et al. (2002) and Kouchnir (2003) use co-training and boosting, respectively. Hinrichs et al. (2005) employ a hybri...

متن کامل

GuiTAR-based Pronominal Anaphora Resolution in Bengali

This paper attempts to use an off-the-shelf anaphora resolution (AR) system for Bengali. The language specific preprocessing modules of GuiTAR (v3.0.3) are identified and suitably designed for Bengali. Anaphora resolution module is also modified or replaced in order to realize different configurations of GuiTAR. Performance of each configuration is evaluated and experiment shows that the off-th...

متن کامل

A Data-driven Approach to Pronominal Anaphora Resolution for German

This paper reports on a hybrid architecture for computational anaphora resolution (CAR) of German that combines a rule-based pre-filtering component with a memory-based resolution module (using the Tilburg Memory Based Learner – TiMBL). The data source is provided by the TüBa-D/Z treebank of German newspaper text (Telljohann et al. 04) that is annotated with anaphoric relations. The CAR experim...

متن کامل

PHORA: A system to solve the Anaphora in Spanish

In this paper we present a whole Natural Language Processing (NLP) system for Spanish. The core of this system is the parser, which uses the grammatical formalism Lexical-Functional Grammars (LFG). Another important component of this system is the anaphora resolution module. To solve the anaphora, this module contains a method based on linguistic information (lexical, morphological, syntactic a...

متن کامل

Anaphora Resolution: To What Extent Does It Help NLP Applications?

Papers discussing anaphora resolution algorithms or systems usually focus on the intrinsic evaluation of the algorithm/system and not on the issue of extrinsic evaluation. In the context of anaphora resolution, extrinsic evaluation concerns the impact of an anaphora resolution module on a larger NLP system of which it is part. In this paper we explore the extent to which the well-known anaphora...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006